Evaluation of General Set Expressions

نویسندگان

  • Ehsan Chiniforooshan
  • Arash Farzan
  • Mehdi Mirzazadeh
چکیده

We consider the problem of evaluating an expression over sets. The sets are preprocessed and are therefore sorted, and the operators can be any of union, intersection, difference, complement, and symmetric difference (exclusive union). Given the expression as a formula and the sizes of the input sets, we are interested in the worst-case complexity of evaluation (in terms of the size of the sets). The problem is motivated by document retrieval in search engines where a user query translates directly to an expression over the sets containing the userentered words. Special cases of of this problem have been studied [9,7] where the expression has a restricted form. In this paper, we present an efficient algorithm to evaluate the most general form of a set expression. We show a lower bound on this problem for expressions of the form E1, or E1 − E2 where E1 and E2 are expressions with union, intersection, and symmetric difference operators. We demonstrate that the algorithm’s complexity matches the lower bound in these instances. We, moreover, conjecture that the algorithm works optimally, even when we allow difference and complement operations in E1 and E2.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating Embedded Question Reuse in Question Answering

The investigation presented in this paper is a novel method in question answering (QA) that enables a QA system to gain performance through reuse of information in the answer to one question to answer another related question. Our analysis shows that a pair of question in a general open domain QA can have embedding relation through their mentions of noun phrase expressions. We present methods f...

متن کامل

An Empirical Evaluation and Analysis of the Fault-Detection Capability of MUMCUT for General Boolean Expressions

Boolean expressions are extensively used in software specifications. It is important to generate a small-sized test set for Boolean expressions without sacrificing the fault-detection capability. MUMCUT is an efficient test case generation strategy for Boolean expressions in Irredundant Disjunctive Normal Form (IDNF). In the real world, however, Boolean expressions written by a software designe...

متن کامل

Incremental Evaluation of Nest and Unnest Operators in Nested Relations

Incremental expressions are important in materialized view maintenance, query optimization, and constraint checking. In this paper, we derive incremental expressions for the nest and unnest operators in nested relations. The expressions are derived for both general nested relations using set operators for updating relations and for partition-normal form (PNF) relations using extended set operat...

متن کامل

Language and the Socio-Cultural Worlds of Those Who Use it: A Case of Vague Expressions

 The present study is an attempt to investigate the use of vague expressions by intermediate EFL learners. More specifically, the current study focuses on the structures and functions of one of the most common categories of vague language, i.e. general extenders. The data include a 22-hour corpus of English-as-a-foreign-language conversations. A comparison is also made between this corpus and a...

متن کامل

Pragmatic expressions in cross-linguistic perspective

This  paper  focuses  on  some  pragmatic  expressions  that  are  characteristic  of  informal  spoken English, their possible equivalents in some other languages, and their use by EFL learners from different  backgrounds.  These  expressions,  called  general  extenders  (e.g.  and  stuff,  or something), are shown to be different from discourse markers and to exhibit variation in form, funct...

متن کامل

Reliability assessment of power distribution systems using disjoint path-set algorithm

Finding the reliability expression of different substation configurations can help design a distribution system with the best overall reliability. This paper presents a computerized a nd implemented algorithm, based on Disjoint Sum of Product (DSOP) algorithm. The algorithm was synthesized and applied for the first time to the determination of reliability expression of a substation to determine...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008